2024 iThome 鐵人賽

DAY 10

Python

Python和R入門語法比較系列第 10 篇

03 more about csv in Python and R [16th 鐵人 Day 10]

16th鐵人賽

carplee

團隊為你抓鯉魚

2024-09-23 10:28:37

112 瀏覽

分享至

下載song_rank.csv

import pandas as pd

with open('data/song_rank.csv') as f:
    p = pd.read_csv(f)

Python

看整個表格長怎樣

.shape
.info()
.dtypes
.describe( )

# 1. 列數 欄數 .shape
p.shape

    (10, 7)

# 2. 簡要資訊 .info( )
表格結構: 類型,筆數,欄數,各欄型態

p.info()

    <class 'pandas.core.frame.DataFrame'>
    RangeIndex: 10 entries, 0 to 9
    Data columns (total 7 columns):
    Rank      10 non-null int64
    Hits      10 non-null int64
    Song      10 non-null object
    Co        10 non-null object
    Artist    10 non-null object
    Date      10 non-null object
    Url       10 non-null object
    dtypes: int64(2), object(5)
    memory usage: 688.0+ bytes

# 3. 欄位型態 .dtypes

p.dtypes

    Rank       int64
    Hits       int64
    Song      object
    Co        object
    Artist    object
    Date      object
    Url       object
    dtype: object

# 4. describe( ) 統計資料(only for數值)
個數, 平均數,標準差, min,第1分位,中位數,第3分位,max

p.describe()

看每一欄長怎樣

p.columns

    Index(['Rank', 'Hits', 'Song', 'Co', 'Artist', 'Date', 'Url'], dtype='object')

p.Rank #p['Rank']

    0     1
    1     2
    2     3
    3     4
    4     5
    5     6
    6     7
    7     8
    8     9
    9    10
    Name: Rank, dtype: int64

type(p.Rank)

    pandas.core.series.Series

p.Artist

    0                       五月天 阿信
    1                     魏嘉瑩, 魏如昀
    2    陳芳語 (Kimberley Chen), 茄子蛋
    3                      蕭敬騰, 馬佳
    4                吳汶芳 (Fang Wu)
    5                 琳誼 Ring, 許富凱
    6            張語噥 (Sammy Chang)
    7                      Ray 黃霆睿
    8                飛兒樂團 (F.I.R.)
    9                      摩登兄弟劉宇寧
    Name: Artist, dtype: object

type(p.Artist)

    pandas.core.series.Series

Series

dtypes:

int
object

R

getwd()
setwd('/Users/carplee/Desktop/untitled folder/')

r = read.csv('data/song_rank.csv')
##### 看整個表格長怎樣 ####
#str()表格結構:類型,筆數,欄數,各欄型態,值
str(r) #base

#summary()統計資料:Min,第1分位,中位數,平均數,第3分位,Max
summary(r)

#### 看每一欄長怎樣 ####
colnames(r)
r$Rank
class(r$Rank)
# [1]  1  2  3  4  5  6  7  8  9 10
# > class(r$Rank)
# [1] "integer"

#### 數值向量 ####

內容預告：

04 Python: pandas Series 數值資料 v R: 數值向量

05 Python: Pandas Series 字串資料 v. R:文字向量

06 日期 in Python and R

07 Python 和 R 的字串處理

08 [R] 用Regular Expression(正規表示法)處理文字

02-2 Python的read...和 R語法 #讀檔

04 Python: pandas Series 數值資料 v R: 數值向量 [16th 鐵人 Day 11]

系列文

Python和R入門語法比較共 30 篇

RSS系列文訂閱系列文

1 人訂閱

完整目錄

直播研討會

{{ item.channelVendor }} {{ item.webinarstarted }} |

直播中

尚未有邦友留言

立即登入留言

參賽組數

1064 組

團體組數

40 組

累計文章數

22211 篇

完賽人數

600 人

15th鐵人賽 16th鐵人賽 13th鐵人賽 14th鐵人賽 12th鐵人賽 11th鐵人賽鐵人賽 2019鐵人賽 javascript 2018鐵人賽 python 2017鐵人賽 windows php c# windows server linux css react vue.js

IT邦幫忙

Python和R入門語法比較系列 第 10 篇

03 more about csv in Python and R [16th 鐵人 Day 10]

Python

看整個表格長怎樣

看每一欄長怎樣

Series

R

內容預告：

04 Python: pandas Series 數值資料 v R: 數值向量

05 Python: Pandas Series 字串資料 v. R:文字向量

06 日期 in Python and R

07 Python 和 R 的 字串處理

08 [R] 用Regular Expression(正規表示法)處理文字

尚未有邦友留言

標記使用者

Python和R入門語法比較系列第 10 篇

07 Python 和 R 的字串處理